Picture for Wenhan Yu

Wenhan Yu

ESPO: Early-Stopping Proximal Policy Optimization

Add code
May 28, 2026
Viaarxiv icon

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

Add code
May 27, 2026
Viaarxiv icon

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Add code
May 22, 2026
Viaarxiv icon

TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment

Add code
Jan 26, 2026
Viaarxiv icon

EntroCoT: Enhancing Chain-of-Thought via Adaptive Entropy-Guided Segmentation

Add code
Jan 08, 2026
Viaarxiv icon

MCPAgentBench: A Real-world Task Benchmark for Evaluating LLM Agent MCP Tool Use

Add code
Dec 31, 2025
Viaarxiv icon

BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer

Add code
Nov 19, 2025
Viaarxiv icon

Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models

Add code
Nov 19, 2025
Figure 1 for Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models
Figure 2 for Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models
Figure 3 for Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models
Figure 4 for Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models
Viaarxiv icon

Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning

Add code
Feb 06, 2025
Figure 1 for Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning
Figure 2 for Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning
Figure 3 for Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning
Figure 4 for Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning
Viaarxiv icon

Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation

Add code
Apr 19, 2024
Figure 1 for Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Figure 2 for Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Figure 3 for Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Figure 4 for Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation
Viaarxiv icon